THE STRUCTURE OF STRONGLY STATIONARY SYSTEMS 



NIKOS FRANTZIKINAKIS 

Abstract. Motivated by a problem in crgodic Ramsey theory Furstenberg and Katz- 
nelson introduced the notion of strong stationarity, showing that certain recurrence 
properties hold for arbitrary measure preserving systems if they are valid for strongly 
stationary ones. We construct some new examples and prove a structure theorem for 
strongly stationary systems. The building blocks are Bernoulli systems and rotations 
on nilmanifolds. 



1. Introduction 

1.1. Historical background. In 1975 Szemeredi proved the following long standing 
conjecture of Erdos and Turan: 

Theorem 1.1 (Szemeredi). Let A be a subset of the integers with positive upper density. 
Then A contains arbitrarily long arithmetic progressions. 

Szemeredi's proof was combinatorial in nature and intricate. In 1977 Furstenberg 
( |Fu77j ) gave an entirely different proof using ergodic theory. He showed that Szemeredi's 
theorem is equivalent to a statement about multiple recurrence of measure preserving 
systems and then proved the ergodic version: 

Theorem 1.2 (Furstenberg). Let (X,B, /i, T) be a finite measure preserving system 
and A e B be a set with positive measure. Then for every k G N, there exists nGN such 
that 

fi{A n T- n A n ■ ■ ■ n T~ nk A) > o. 

Furstenberg's proof launched the field of ergodic Ramsey theory, where problems 
in combinatorics are translated to recurrence related statements of measure preserv- 
ing systems and then proved using ergodic theory. Using this approach Furstenberg and 
Katznelson, and more recently Bergelson and Leibman (among others) established several 
ergodic theoretic results whose combinatorial implications are not currently attainable 
by any other methods. This includes a multidimensional and a polynomial extension of 
Szemeredi's theorem ( |FK79j . I3L96 ), and the density version of Hales- Jewett theorem 
( [FK9T] ) . the "master" theorem that contains several major results in the field as special 
cases. 

The notion of strong stationarity (Definition IH.lj) was introduced in the paper of 
Furstenberg and Katznelson ( |FK91j ) in proving the density version of Hales- Jewett 
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theorem. An important result established in the same paper is that an arbitrary sta- 
tionary process "majorizes" a strongly stationary one (Section 13.2)1 . From this it follows 
that several recurrence properties are valid for arbitrary measure preserving systems if 
they are valid for strongly stationary ones. In particular, it turns out to be sufficient to 
verify Theorem 11.21 for the specific case of a strongly stationary system. This motivated 
the problem of determining the structure of strongly stationary systems. 

In |Je97j Jenvey proved that every ergodic strongly stationary system is necessarily 
Bernoulli. Unfortunately, not every ergodic system majorizes an ergodic strongly station- 
ary one, and the ergodic components of a strongly stationary system are not necessarily 
strongly stationary. In fact nonergodic strongly stationary systems can have completely 
different structure than the ergodic ones; there exist several distal examples that one has 
to identify. 

We give a structure theorem for strongly stationary systems (Theorems l(i . (il and 16 . 9)1 : 

Main Theorem, (i) Almost every ergodic component of a strongly stationary system is 
isomorphic to the direct product of a Bernoulli system and a totally ergodic pro-nilsystem 
(defined in Section WT\) . 

(ii) An extremal strongly stationary system (Definition \4 ■ 1]) is isomorphic to the direct 
product of a Bernoulli system and a strongly stationary system associated to some pro- 
nilmanifold (defined in Section \6J\/ . 

Moreover, we construct new examples of strongly stationary systems (Section 16.4) 
examples (iv) and (v)). 

1.2. Format of the paper. After reviewing some preliminary notions and results in 
Section 2 we define strong stationarity in Section 3. We prove that an arbitrary stationary 
process majorizes a strongly stationary one and give the basic examples of strongly 
stationary systems. 

The general strongly stationary system is an integral of extremal ones. In Section 
4 we give necessary and sufficient conditions for extremality and prove a homogeneity 
property for extremal strongly stationary systems. 

In Section 5 we prove that almost every ergodic component of a strongly stationary 
system is isomorphic to the direct product of a distal system and a Bernoulli system. 
Moreover, we show that the distal factor of a strongly stationary system is strongly 
stationary and coincides with the characteristic factor of the system (defined in Section 
15.4)1 . This reduces our problem to determining the structure of distal strongly stationary 
systems. 

Finally, in Section 6 we obtain a structure theorem for distal strongly stationary 
systems. Using results from Section 5, in conjunction with a recent result of Host and 
Kra ([HK03J), we show that their ergodic components are pro-nilsystems. Moreover, 
we construct new examples of distal strongly stationary systems with nonaffine ergodic 
components. This new set of examples allows us to give a complete classification. 
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2. Preliminaries 

To facilitate the reading, we establish our notation and review some basic results that 
are used in the sequel. We refer the reader to [Fu81 , [Pe89], and |Wa82j for more details. 

2.1. Measure preserving systems. A measure preserving system (or just system) is a 
measure space (X, B, /i) together with a measurable measure preserving transformation 
T on it. Throughout the discussion we assume that all measure spaces are Lebesgue. 
When there is no danger of confusion we use the bold symbol X to denote the system 
(X, B, fi, T). We also use the bold symbol T to denote the operator T: L°°(X) -> L°°(X) 
defined by (Tf)(x) = f(T(x)). 

Let X = (X, B, /i, T) be an invertible measure preserving system and Bo be a sub-cr- 
algebra of B. By \J™ n T l Bo we denote the a-algebra spanned by sets of the form T l B 
where B £ Bo, —n < i < m. We say that £>o is T-generating if V/^°oo T*Bo = B up to null 
sets. If T is an algebra of bounded immeasurable complex valued functions, we denote 
by B(!F) the sub-cx-algebra of jF-measurable sets, that is, the cx-algebra generated by sets 
of the form f _1 (A), where / £ T and A C C is open. We say that T is T-generating if 
B{J-) is T-generating. 

2.2. Furstenberg's structure theorem. Let X be a measure preserving system, Y = 
(Y,S,u,R) be a factor of X, and \i = J \i y du(y) be the disintegration of /x over Y. A 
function / £ T 2 (X) is compact relative to the factor Y if for every e > there exist 
functions g±, . . . , g m £ T 2 (X) such that mhii< s < m ||T 1 / — <?s||z2(„ ) < e for every i £ N 
and for z/-a.e. y £ Y. An extension X of Y is compact if the set of compact functions 
relative to Y is dense in L 2 (X). 

Starting with the factor of T-invariant functions Do, we define inductively Dj + i to 
be the maximal compact extension of D,. More precisely, we consider the subalgebra 
generated by the compact functions relative to Dj and we define D i+1 to be the factor 
determined by it. We call the factor the k-step distal factor of the system X. The 
maximal factor that can be exhausted by a transfinite number of compact extensions is 
called the distal factor and is denoted by D. 

We say that X is a relatively ergodic extension of Y, if every T-invariant function on 
X is /i-a.e. a function on Y (that is ^-measurable). 

Let Xx, X 2 be two extensions of Y. The fiber product space is defined as 

X 1 x y X 2 = {(xi,x 2 ) £ X 1 xX 2 : ni(xi) = n 2 (x 2 )} , 

where nf. Xi — > Y the factor map. By B' we denote the restriction of B\ x B 2 on 
Xi x Y X 2 , and by // the measure defined by the disintegration (over Y) \il y = ^ x /x 2 , 
where /ii = / //* du(y) and ji 2 = f fly dv{y). If Xx = X 2 = X, and Y is the factor 
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determined by the action of T on a sub-cx-algebra 8 of B, then 

J f(x 1 )f(x 2 ) d/i 1 = J E(/i|£)E(/ 2 |£) dfi 

where E(/|£) denotes the conditional expectation of / given £ . We can check that 
XxXyX 2 = (Xi XyX 2 , B', /jf, Ti x T 2 ) is a measure preserving system that extends Y. 
We say that X is a relative weak mixing extension of Y if XxyX is a relatively ergodic 
extension of Y. 

Furstenberg ( |Fu81j ) proved the following structure theorem: 

Theorem 2.1 (Furstenberg). Every measure preserving system is a relative weak mix- 
ing extension of its distal factor. 

We note that the distal factor is the smallest factor with respect to which the system is 
relative weak mixing. The following result was proved in |Fu77j for ergodic systems and 
is needed in the sequel. The argument given there also works for nonergodic systems. 

Lemma 2.2 (Furstenberg). The invariant functions of X Xy X belong to the closed 
subspace spanned by functions of the form /i (^1)72(^2) where f\, f 2 are functions compact 
relative to Y. 

2.3. Stationary processes. A A-valued stochastic process is a sequence of measurable 
functions (random variables) {/i}i e z defined on a probability space (X, B, //) with values 
in a compact metric space A (the state space). 

The finite dimensional statistics of a stationary process {fi}i^z is the collection of all 
measurements /i( f\ i= _ k {fi G AA-), where fceN and A4 C A are open. 

A stochastic process is stationary if its finite dimensional statistics are invariant under 
translations of the time parameter, that is, n(f) i= _ k {fi+ r £ Ai}) — A t (Di=-A;{/j e ^«}) 
for all k, r G N and open sets A4 C A. 

Two stationary processes {fi}iez and {gi}iez (suppose that //, v are the corresponding 
underlying measures) are equivalent if they have the same finite dimensional statistics, 
that is, /i( f) i= _ k {fi E Ai}) = v{ f)i=-k{9i e ^}) f° r ever Y k E N and open sets A4 C A. 

2.4. Sequence space representations. Let A be a compact metric space. The se- 
quence space A z equipped with the product topology is again a compact metric space. 
We denote by Xi the i-th coordinate of a point x 6 A z . The Borel a-algebra A z is gen- 
erated by the finite dimensional rectangles n-fc{ x: Xi ^ wnere each Ai C A is open. 
A probability measure a defined on the completion of A z (which we denote again by 
A z ) is stationary if cr(f] i= _ k {x: Xi G Ai}) = cr(f] i= _ k {x: Xi +r G AA-) for every k G N, 
r G Z and open sets C A. Having fixed the space A we denote by AA. the set of all 
probability measures and by M. s the space of all stationary measures on the sequence 
space A z . Both M. and M. s endowed with the weak-star (w*) topology are compact 
convex spaces. 
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Let a be a stationary measure on the sequence space A z . The shift operator S, defined 
by (Sx)k = Xk+i, is continuous and the system A z = (A z , A z , a, S) is measure preserving. 
We call it the sequence space system determined by the stationary measure a. Moreover, 
we call the stationary process {xi} ie z the sequence space process determined by a. To 
ease notation we denote by X{ both the i-th coordinate of a point x and the function 
that maps each point to its i-th coordinate. 

Let {fi}i£z be a A-valued stationary process. If there exists a stationary measure a 
on A z that makes the processes {fi}i^z and {xi} i€ z equivalent we say that the second 
process is the sequence space representation of the first. The next classical result ( |Br92j . 
page 107) is an easy consequence of Kolmogorov's extension theorem. 

Proposition 2.3. Every A-valued stationary process {fi}iez has a A z sequence space 
representation. 

Let X be a measure preserving system and T be a T-generating subalgebra. Suppose 
that X is isomorphic to a sequence space system I z = (I z , B z , a, S), where / = [0, 1], and 
the isomorphism 0: X — > I z maps sets in i3(jF) to Xo-measurable sets. Then we say that 
the system I z is the sequence space representation o/X with respect to the subalgebra T . 
The next proposition is a variation of a classical result: 

Proposition 2.4. Every invertible measure preserving system X has a sequence space 
representation with respect to any T-generating subalgebra T . 

Sketch of the Proof. By a classical result of Rokhlin ( I{o(i2| ) the sub-cr-algebra B$ = 
£>(JF) induces a partition V = {P t }tei of X by So-measurable sets such that every So- 
measurable set is equal (up to a set of measure zero) to a union of partition elements 
Pt- For every open set A G I we set A' = {J teA Pt- We define the measure a on cylinder 
sets of I z by a(f)™ = _ n {x: Xi E Ai}) = n(f)™ = _ n {x: T l x E A'j}), and then extended it 
to the whole sequence space using Kolmogorov's extension theorem. We can check that 
the advertised sequence space representation of X is determined by the measure a. 

2.5. Van der Corput's Lemma. The following classical lemma will be needed in the 
sequel f |FW96j . page 47): 

Lemma 2.5 (Van der Corput). Let {a; n } ne N be a bounded sequence of vectors in a 
Hilbert space. For each m we set 
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Then 



1 N 

lim — > 

n=l 



in the L 2 



norm. 



3. Strong stationarity 



3.1. Definitions. 

Definition 3.1. (i) A stationary process {fi}i<zz is strongly stationary if the finite di- 
mensional statistics of {fi}i^z and {f n i}iez are the same for every n G N. 

(ii) An invertible measure preserving system X is strongly stationary if there exists a 
T-generating function algebra T such that every process {T J /}iez is strongly stationary 
for / G T. When we want to also indicate the subalgebra T we write that (X, J 7 ) is 
strongly stationary. 

Remark. Equivalently, a system X is strongly stationary if there exists a T-generating 
function algebra T such that 



Let A be a sequence space system. The subalgebra JF of bounded x -measurable 
functions is T-generating. Strong stationarity with respect to JF is equivalent to saying 
that the maps r n defined by (r n x)i = x n i are measure preserving for every n G N. 

Definition 3.2. A measure a on the sequence space A z is strongly stationary if the 
system (A Z ,JF ) is strongly stationary. 

3.2. Stationary processes majorize strongly stationary ones. Let and 

{9i}i& be two A-valued stationary processes with underline measures /i, v correspond- 
ingly. We say that {fi} i& majorizes {^} ieZ if 



for every fceN and open sets Ai C A. 

Furstenberg and Katznelson f |FK91j ) proved that every stationary process majorizes 
a strongly stationary one. Actually they established a much more general result using 
a strong selection theorem. The argument given below was suggested by Y. Peres and 
gives an easier proof for the case that we are interested. 

Theorem 3.3 (Furstenberg and Katznelson). Every stationary process majorizes a 
strongly stationary one. 




for n 6 N, fi G T. 



k 



k 
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Proof. By Proposition 12.31 there exists a stationary measure a on the sequence space A z 
such that the processes {fi}iez and {xi}i e i have the same finite dimensional statistics. 
For n G N, let r n be the map defined on A z by (r n x)i = x n {. It is straightforward to 
check that the measure r n a defined by r n a(A) = a^r^A) is stationary. If we denote by 
O the closure in the w*-topology of the set of all convex combinations of the measures 
r n o~, 77, G N, then O is a compact convex subset of M. s . The maps r n commute and act 
continuously and affinely on O so by the Markov-Kakutani fixed point theorem ( |Co85j . 
page 151) they have a common fixed point v G O. 

We claim that the stationary process {xi} i£ z which is induced by the measure v on 
the sequence space A z is strongly stationary and is majorized by the process {fi}iez- 
Indeed, the invariance over each r n proves strong stationarity. Moreover, convex linear 
combinations of the measures r n a come arbitrarily close to v in the w*-topology. Hence, 
for every e > and choice of A^s there exists 77 G N such that 

k k 
<r( P| {Xin G Ai}) > u( P| {Xi G Ai}) - £. 

i=—k i=—k 

Since for every 77 G N we have cr(f] i= _ k {x in G A ; }) = ^(ft i= _ k {f in G A^) and 
f(r\i = _ k {xi G Ai}) = v(rii=-k{ x in £ Ai}) the result follows. □ 

We deduce now a similar result for measure preserving systems. 

Corollary 3.4. Let X = (X,B, fi,T) be any (not necessarily invertible) measure pre- 
serving system and A be a B-measurable set. Then there exists an invertible strongly 
stationary system X = (X ,B, jl,T) and B G B, such that fi(A) = jl(B) and 

k k 
sup n(P\T- zn A) > sup MC\f- m B) 

for every fceN. 

Proof. Let 1a denote the indicator function of the set A. Using a standard argument 
we extend the one sided stationary process {/i}j e N = {T^l^jjeN to a two sided one. 
We denote the two sided extension by {/j}j 6 z- By Theorem 13.31 the process {/i}j e z 
majorizes a strongly stationary one {^ijjgz- Following the proof of Theorem 13.31 we see 
that {gi}icz = {S l x } i€ z, where S is the (invertible) shift transformation on the sequence 
space {0, 1} Z with some appropriately chosen measure a. We let B = {x G {0, 1} Z : x = 
1} and set X = ({0, 1} Z , B z , a, S) . If JF is the subalgebra of bounded x -measurable 
functions then (X, JF ) is strongly stationary. The advertised inequality is valid since 
{/i}igz majorizes {gi}i^z- Finally, following again the proof of Theorem 13.31 we see that 
li(A) = a(B). □ 
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3.3. The r n 's and Jenvey's result. Let X be a strongly stationary system. The next 
proposition was proved in |,le97j and gives useful necessary and sufficient conditions for 
strong stationarity: 

Proposition 3.5 (Jenvey). The measure preserving system X is strongly stationary 
if and only if there exists a T-generating function algebra T , and a family of measure 
•preserving transformations {r n } n€N , that leave every function in T invariant and such 
that the operators T and T n satisfy the commutation relations 

(1) r n T = T n T„, neN. 

Moreover, we can choose the r n 's to satisfy T mn = T m T n , for all m, n G N. 

Remark. Equivalently, for the point transformations T and r n relation ((TJ can be written 
as (Tr n )(x) = (r n T n )(x), for a.e. x G X and n£ff. 

Using the multiple weak- mixing theorem ( |Fu81j . page 86) it is easy to see that if a 
strongly stationary system is weak-mixing then it is Bernoulli. In |Je97j Jenvey shows 
that the same conclusion holds if we just assume ergodicity. 

Theorem 3.6 (Jenvey). Every ergodic strongly stationary system is a Bernoulli system. 

We remark that strong stationarity of a system does not imply that of its ergodic 
components, so we cannot use this theorem to determine the structure of the general 
strongly stationary system. 

Applying the argument in the proof of Theorem 13.61 for the general (not necessarily 
ergodic) strongly stationary system we can deduce the following: 

Proposition 3.7. If X is a strongly stationary system and A is an eigenvalue ofT then 
A = 1. 

Since the proof is too long to reproduce we just indicate the strategy. Suppose that \ 
is a A-eigenfunction of T, A is not a root of unity, and that g G \/ k _ k T % T . We have 

1 N f 

X g dfx= lim — V, / T n \ r n g dfx. 

n=l J 

To show that the limit on the right is zero we use repeatedly Van der Corput's lemma 
fLemma 12.5(1 and relation As it turns out, it suffices to show that the sequence of 
functions u n = T~ n (T n + m x T nX) converges weakly to zero as n — > oo for every m G N. 
From relation $1} it follows that r n x is a linear combination of A 1,/n -eigenfunctions. When 
A is not a root of unity this easily gives that the sequence u n consists of "almost" pairwise 
orthogonal functions and hence converges weakly to zero. It follows that \ is orthogonal 
to the subalgebra \j ' _ k T % T for every k G N. Since the algebra T is T-generating we get 
that x = 0- The case where A is a nontrivial root of unity is trickier but the strategy of 
the proof is similar. 
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3.4. Examples of strongly stationary systems. We give now the basic examples of 
strongly stationary systems. 

(i) Exchangeable systems. A system is exchangeable if there exists a T-generating 
function algebra T which has finite dimensional statistics invariant under any permuta- 
tion of the time parameter. Bernoulli systems are exchangeable. A theorem of de Finetti 
says that a system is exchangeable if and only if it is a mixture of Bernoulli systems. We 
can check that exchangeability implies strong stationarity but the converse is not true 
as the next example shows. 

(ii) Ergodic components circle rotations. On the 2-torus, with group action addition 
mod 1 and the Haar measure, define T(x,y) = (x,y + x). To check that the system 
is strongly stationary we use Proposition 13.51 We let T be the algebra generated by 
the exponentials in y and define the maps r n by r n (x,y) = (nx,y). Since e iy G T and 
e ix _ rp e %y e -iy ^ ajggb^ jr j s T-generating. Moreover, r n is measure preserving, each 
r n leaves functions in T invariant, and 

Tr n (x, y) = (nx, y + nx) = r n T n (x, y). 

Hence, T n T = T n r n for n e N. 

(iii) Ergodic components affine transformations on T d . On T 3 , with group action 
addition (mod 1) and the Haar measure, define T(x, y, z) = (x,y + x, z + y). To see that 
the induced measure preserving system is strongly stationary we use again Proposition 
13.51 We let T be the algebra generated by exponentials in z and define the maps r„ 
by r n (x,y,z) = [n 2 x,ny + (™)x,z). We can check as before that the algebra T is T- 
generating. Each r„ clearly leave functions in T invariant and a direct computation 
shows that it satisfies the right commutation relations. 

More generally, on T d with group action addition (mod 1) and the Haar measure, 
define 

T(xi, x 2 ,...,x d ) = (x 1 ,x 2 + x 1 ,...,x d + x d _i). 
This time T is the algebra generated by exponentials in x& and the r n 's have the form 

r n (x 1 ,x 2 , ...,x d ) = {n d ~ 1 xi ) n d ~ 2 x 2 + Q^ 1 ' ' ' ^ nXd - 1 H h [d - ij Xl ' Xd )- 

(iv) Ergodic components affine maps on more general groups. In the last two examples 
T can be replaced by any connected compact abelian group. The form of the r n 's and 
of the algebra T is similar. The connectedness assumption is needed to guarantee that 
each r n is measure preserving. 

Although the previous examples provide an ample supply of strongly stationary sys- 
tems, the building blocks are always Bernoulli systems and affine transformations on 
compact abelian groups. In the last section we will see that this is not the case in 
general (Section l6.4| example (iv)). 
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4. EXTREMALITY 

4.1. Definition of extremality. Recall that by Proposition 12.41 every strongly sta- 
tionary system (X, JF) has a sequence space representation (I Z ,JF ), where / = [0,1] 
and T§ is the subalgebra of bounded xo-measurable functions. This representation is 
completely determined by the measure a on the sequence space, so characterizing (up 
to isomorphism) the strongly stationary measure preserving systems is equivalent to 
characterizing the strongly stationary measures on I z . Furthermore, we only have to 
determine the structure of the extremal ones, that is, those that cannot be decomposed 
nontrivially into a convex combination of strongly stationary measures. We will make 
this more precise below. 

Consider the space Ai ss of all strongly stationary measures. Then M. ss is a closed 
convex subset of the space of stationary measures M. s which is u>*-compact, metrizable, 
and locally convex. 

Definition 4.1. (i) Let o be a strongly stationary measure on J z . Then a is extremal 
if it cannot be written in the form a = aai + (1 — a)o~ 2 for some < a < 1 and strongly 
stationary measures cti ^ a 2 . 

(ii) Let (X, JF) be a strongly stationary system and a be the strongly stationary 
measure that determines its sequence space representation with respect to T. Then 
(X, JF) is extremal if a is an extremal strongly stationary measure. 

We will use the following integral representation theorem of Choquet ( |Ph01j ): 

Theorem 4.2 (Choquet). Let X be a metrizable compact convex subset of a locally 
convex space E and xq G X . Then there exists a Borel probability measure fi on X, 
supported on the extreme points ext(X), that satisfies xq = f ext , x \ xdfi(x) (that is, l(xo) = 
lext(x) K x )ds{ x ) holds for every I in the dual of X). 

It follows that the general strongly stationary measure or system is an integral of 
extremal ones. So we can focus our attention on determining the structure of the extremal 
strongly stationary measures or systems. 

4.2. Necessary and sufficient conditions. It is well known that the set of extremal 
points of the space Ai s is the set of ergodic measures (with respect to the shift transfor- 
mation S), and that different ergodic measures are mutually singular. It is not hard to 
establish the analogous results for the space of strongly stationary measures A4 SS . The 
corresponding action on A4 SS is the joint action of S and the r n 's {{j n x)i = x n i). 

Proposition 4.3. Different extremal strongly stationary measures are mutually singular. 

Proof. Let fii and fi 2 be two different extremal strongly stationary measures and let 5* 
denote the shift transformation on I z . Consider the Lebesgue decomposition of \i\ with 
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respect to /i 2 , that is, write H\ = av\ + (1 — a)v 2 where u x , v 2 are probability measures 
such that v\ _L fi 2 and v 2 {i 2 . Then 

Hi = Sfi\ = a Sv\ + (1 — a) Sv 2 , 
Hi = T ri fii = a r n v x + (1 - a) T n V 2 . 

The Lebesgue decomposition is unique, so both S and r n preserve ui, v 2 . This means 
that V\ and v 2 are both strongly stationary measures. Since [i\ is extremal we have 
either /ii = v\ or /ii = v 2 . If /ii = v\ then /ii and /i 2 are mutually singular. So it 
remains to show that /ii 7^ v 2 . Suppose on the contrary that /ii = v 2 . Then ji\ <C /i 2 , so 
fii = J fdfi 2 for some / G L 1 (/i 2 ). The choice of / is unique, so we conclude as before 
that / is S, r n -invariant (with respect to fj, 2 ). Since / is nonconstant (/^ 7^ /i 2 ) there 
exists a 5, r n -invariant set A such that < ^ 2 {A) < 1. Then /x 2 is a nontrivial convex 
combination of the induced strongly stationary measures on A and A c . This contradicts 
the extremality of [i 2 and completes the proof. □ 

Proposition 4.4. A strongly stationary measure a is extremal if and only if the joint 
action of S and the r n 's is ergodic. 

Proof Suppose that the joint action is not ergodic, that is, there exists a S, T„-invariant 
set A with < o~(A) < 1. Then a is a nontrivial convex combination of the induced 
strongly stationary measures on A and A c . Hence a is not extremal. 

Conversely, suppose that the joint action is ergodic. Let a = ao\ + (1 — a)a 2 , for some 
strongly stationary measures <7i, a 2 . Then o\ is absolutely continuous with respect to cr 
and the corresponding Radon-Nikodym derivative dajdo\ is a S, r n -invariant function. 
Since the joint action is ergodic, dojdo\ is constant /x-a.e.. Hence a = <j\. This proves 
that a is extremal. □ 

Remarks, (i) It follows that a strongly stationary system (X, T) is extremal if and only 
if the joint action of T and the r n 's (of Proposition IH.5jl is ergodic. 

(ii) Using this proposition we can easily check that the examples on T d given in Section 
13.41 are extremal. 

4.3. Homogeneity property. In this section we will show that the ergodic components 
of an extremal strongly stationary system enjoy a homogeneity property, in the sense 
that their structure is similar. 

Lemma 4.5. Let (X, T) be a strongly stationary system. Then r n leaves the sub-a- 
algebra of T -invariant sets invariant. 

Proof. Let A be a T-invariant set, that is T~ 1 A = A. The commutation relations of 
Proposition 13.51 give that 

t~ x A = r-'T-'A = T~ n T~ x A. 
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Hence, t~ 1 A is left invariant by T n . By Theorem 13.71 the transformation T does not 
have nontrivial roots of unity as eigenvalues. It follows that t~ 1 A is also T-invariant, 
completing the proof. □ 

Lemma 4.6. Let (X, T) be an extremal strongly stationary system. If A £ I has positive 
measure then [J neN T^ 1 (A) has full measure. 

Proof. Let B = [J neN t~ 1 (A). In view of Proposition 14.41 it suffices to show that B is 
T, x„-invariant. Since r mn = r m T n we have 

r~\B) = |J r-\-\A) = |J r m l{A) c B. 

neN neN 

So the set B is r m -invariant for every m G N. Moreover, since T~ X B is equal to 
[J neN T _1 r„" 1 (y4) and by Lemma f4. 51 the set t~ 1 (A) is T-invariant, B is also T-invariant. 

□ 

Definition 4.7. Let X be a measure preserving system with ergodic decomposition 
fi = J fit d\(t). We say that the sets A, B G X with positive A-measure are factor power 
equivalent (FPE), if for A-a.e. b G B there exists a G A and n G N such that (X, B, fib, T) 
is a factor of (X, £>, fi a , T n ), and vice versa. 

We are now ready to prove the advertised homogeneity property. 

Theorem 4.8. Let (X,JF) be an extremal strongly stationary system with ergodic de- 
composition fi = f fi t d\(t). Then any two sets A, B G X with positive measure are 
FPE. 

Proof. By Lemma 14.51 each r n permutes the ergodic fibers. Suppose that r n maps the 
fiber X a to the fiber (b = r n a). The pointwise commutation relations Tr n = r n T n 
show that (X, B, fib, T) is a factor of (X, B, fi a , T n ). By Lemma l4~o1 if Ael has positive 
measure then UneN r n~ 1 (^) nas ^ uu measure. It follows that A is FPE to X. So any two 
sets A,B^X with positive measure are FPE. □ 

We call a property "nice" if it is preserved by factors and powers of measure preserving 
systems. The homogeneity property just established allows us to extend "nice" properties 
from a nontrivial set of ergodic components to A-a.e. ergodic component. 

As an application, suppose that a nontrivial set (of positive A measure) of ergodic 
components of an extremal strongly stationary system (X, T) is weak mixing . We claim 
that it is a Bernoulli system. Indeed, weak mixing is a "nice" property, so Theorem 14.81 
gives that A-a.e. ergodic component of the system is weak mixing. Strong stationarity 
gives 

N 

1/oT/i- T k f k dfi = ^Yl I fo T "/i ■•• Tkn fx d »> 

J n=l J 
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for all k, N £ N, fi <E T. Let \i = J \i t d\(t) be the ergodic decomposition of /i. Letting 
N — > oo and using the multiple weak mixing theorem ( |Fu81j . page 86) we get 

J f T/i • • • T k f k dfi = J (j f dfH J hdiH ■■■ J fk dfj^j d\(t). 

It follows that the system is an integral of Bernoulli systems and being extremal it must 
be Bernoulli. 

5. Reduction to distal systems 

5.1. Characteristic factors. The notion of a characteristic factor was introduced by 
Furstenberg in order to facilitate the study of nonconventional ergodic averages. The idea 
is to find the smallest factor of a system that completely determines the limit behavior 
of these averages and then work with this simpler system. 

Definition 5.1. A factor Y = (Y, £ , v, T) of a system X is characteristic for k terms if 

N k k 

i^E(n T ^-n TinE (^))-° 



n=l i=l i=l 

in L 2 (X) for /< G L°°(X). 

Furstenberg (fFhTTj) proved for ergodic systems that the fc-step distal factor is char- 
acteristic for k + 1 terms. We want to use this result for general systems (not necessarily 
ergodic), so for completeness we include a proof that covers the general case. 

Theorem 5.2 (Furstenberg). Let X be a measure preserving system. Then the factor 
Djfc_i is characteristic for k-terms. 

Proof. We use induction on k. For k = 1 this is the context of the L 2 -ergodic theorem. 
Assume that the statement is valid for k, we will establish it for k + 1. It suffices to show 
that if one of the /,'s is orthogonal to T>k then 

TV fc+1 



lim — V TT T in fi = 



AT-voo jV 

n=l i=l 

in L 2 (X). Indeed, add and subtract K(fi\Vk) to every fi and expand the product. All 
the terms but the two that we are interested will converge to zero giving us the desired 
identity. 

So suppose that E(/i|Pjfe) = (the argument is similar if E(/j|Pfc) = for i ^ 1). We 
apply Van der Corput's lemma (|2.5J) on the Hilbert space £ 2 (X) with a n = Yli=i T m fi- 
In order to show that 

N 



1 

lim — > a n = 



N^oo N 

n=l 
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in L 2 (X) it suffices to establish that 



(2) liniM^oo— ^ 6 m = 0, 

m=l 

where 

1 at ^ N « k 

b m =K^ N ^ oc \-J2 < a n+m,a n > | = En" A r_ K30 |-J] / n T "( T(i+1)m /^0 ^ 

n=l n=l ^ i=0 

By the induction hypothesis the last limit is equal to 

N k 

E^-iivE / n Ti " E ( T(m)m ^^i^- 1 ) ^ • 

n=l i=0 

Now we make use of the fact that E(/i|X>fc) = 0. It follows from Lemma 12.21 that the 
function g(x\,X2) = /i(^i)/i(^2) is orthogonal to the space of invariant functions of 
X Xo k _ 1 X. If S = T x T, applying the L 2 -ergodic theorem for the system X Xn k ,X 
we get 

M M 
m=l m=l 

Since every is bounded the Cauchy-Schwartz inequality gives 

k 

| f JJr^T^^/i/ilP*.!) dfi\<L f \E(T m hh\V k „ x )fdii 
J i=o ^ 
for some number L that is independent of n. Hence, 

6 m < L y |E(T"/i/i|2Vi)| 2 d/i. 

From this and (j3J) it follows that the limit in (J2J) is 0. This completes the induction. □ 

Corollary 5.3. The distal factor of a system is a characteristic factor for k-terms for 
every fceN. 

5.2. Relative Bernoulli extensions. The notion of a relative Bernoulli extension was 
introduced in |Th75bj . 

Definition 5.4. Let X be an ergodic system and Y be a factor of X. Then X is a 
relative Bernoulli extension of Y if X is isomorphic to the direct product of a Bernoulli 
system and Y. 

Let {X,B,fi) be a measure space. Two finite i3-measurable partitions "P — {Pi}i=i 
and Q = {Qi}\ =1 of X are e -independent if 
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Let X be an invertible and ergodic system, Y be a factor of X, and \l — J n y dv(y) be 
the disintegration of \i over Y. A sequence of finite partitions {"Pj}j G z is weak Bernoulli 
relative to Y if for u-a,.e. y the following is true: for given e > there exists AfeN such 
that for every m > 1 

N+m 

y Vi is e— independent of \l V{. 

-m N 

with respect to fi y . 

The following theorem is a consequence of the results of the articles |Th75aj (Propo- 
sitions 3, 4, and 5) and |Th75bj (L emma 6). One can also deduce this from Theorem 2 
in jKi84j . 

Theorem 5.5 (Thouvenot). Lei X be an invertible, ergodic system, andY be a factor 
of X. Suppose that for some finite T -generating partition V the sequence of partitions 
\T % V}i£% is weak Bernoulli relative to Y. Then X is a relative Bernoulli extension of 
Y. 

Note that the relative notion of weak Bernoulli is a stronger property than the relative 
notion of very weak Bernoulli that was used in |Th75bj . 

5.3. The relative Bernoulli property. 

Lemma 5.6. Let (X, T) be a strongly stationary system and {r n } ng N be the maps defined 
in Proposition \3.b\ Then the spaces L 2 (D^) and L 2 (D) are r n -invariant for every n G N. 

Proof. LetnGN. Bv Lemma l4.5l the subspace L 2 (Dn) is r„-invariant. From the definition 
of D/c and D it suffices to show that every maximal compact extension of a r n -invariant 
space is also r n -invariant. So suppose that X is a maximal compact extension of Y and 
that L 2 (Y) is r n -invariant. Consider the disintegration fi = J [i y dv{y) of /i over Y. It 
suffices to show that if / is compact relative to Y then so is r n f. Let e > 0. There exists 
a finite set of functions gi, . . . ,g m such that 

mm || TV -g s \\ L 2 { } < e 

l<s<m " 

for every isN, and z/-a.e. y G Y. Write i = i'n + r, for some i' G N and < r < n — 1. 
Using the commutation relations of Proposition 13.51 we get 

f(r n Tx) = f(T'r n T r x). 

So 

\\T\r n f) -T r (r n g)\\ LHlly) = \\f{T i 'r n T r x)-g{T n T r x)\\ L ^ y) = ||T*7 - g\\ m , yl) , 

where y' = r n T r y. The last equality is valid since L 2 (Y) is invariant under both r n and 
T. It follows that the set of functions {T r (T n g s ), 1 < s < m, < r < n — 1} is fiberwise 
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a finite e-net relative to Y for the orbit {T*(r ri /)} igN . This shows that the function T n f 
is compact relative to Y and completes the proof. □ 

Remark. Since we are only going to use the r n -invariance of L 2 (D) we can avoid the 
use of Lemma 14.51 Indeed, the distal factor can be exhausted by a sequence (possibly 
transfinite) of maximal isometric extensions starting from the trivial factor (determined 
by the algebra of constant functions). We can then use the step by step argument of the 
previous proof to show that L 2 (D) is r n -invariant. 

Theorem 5.7. Let (X, JF) be a strongly stationary system. Then its distal factor is 
strongly stationary and almost every ergodic component of X is a relative Bernoulli 
extension of its distal factor. 

Proof. Step 1. Let D = (D,T>, u, T) be the distal factor of X. Strong stationarity gives 

/oT/i - T k f h dfi= [ / T7i- T kn f k dfi 



for all k, n G N, fi G T . Averaging over n and taking the limit as iV — > oo gives 
/ / T/, ••• T k f k dfi = jim 1 J] f / n ••• T kn f k d^. 

J n=l 



By Theorem 15.21 the last average is equal to 



i - r 

( 4 ) i im m E / E (^l^) T n E(A|2>) ■ ■ ■ T kn E(f k \V) dv. 

n=l J 

The maps {r n } ng N leave the functions in T invariant, as well as the space L 2 (D) (by 
Lemma f5.6p . hence 

(5) r n E(fi\V) = E(r n fi\V) = E(/i|P). 

Since T n T = T n r n and r n is measure preserving for n G N, we get using © that all the 
integrals in (H) are equal to 



E(/ |P)TE(/ 1 |I>) ... T fe E(/ fc |P)^. 



This shows that 



(6) J /oT/i ••• T fc / fc rf/i = 1 E(/ |P)TE(/!|P) ••• T fe E(/ fc |I?) dz/ 

for jfe G N, fi G JF. 

Step 2. We will strengthen © to a fiberwise relation and prove the first claim. Call 
Tt> the algebra generated by functions of the form E(/|X>), where / G T. Let V be the 
sub-u-algebra of V^o T l J^-measurable sets. Clearly V C V. Since V is T-invariant 
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it induces a factor D' = (D',J) > ,p',T) of X. Applying r n to the left hand side of the 
equation below and using the previous averaging technique we get as before that 



(7) 



km „ k m 

i=0 j=0 J i=0 j=0 



for k, m G N, f\ G J 7 , gi G T-v- Observe that E(/|P) is D'-measurable for / G J 7 , so V 
can be replaced by T>' in (|7J). Moreover, all the g^s are P'-measurable, so (J7|) takes the 
following form 

/k k m 

[e( J]T7^') - J^jT i E(/ i |X>')j Y[T 9j dv' = 0. 
i=0 i=0 j=0 

Since y^t^T 1 F-p * s dense in L 2 (D') we get 

I ^(JJt^ip') - JJrE^lP')] <7efc/ = 

for every (7 G L 2 (D'). This can only happen if 

k k 

(9) E^HTMV'Yy) = Y[TE(f t \V>)(y) 



i=0 i=0 



for z/-a.e. ?/ G D'. 

Next we claim that T>' = T>. Relation © easily implies that X is a relative weak mixing 
extension of D'. Since D is the minimal factor with respect to which X is relative weak 
mixing, T> must be contained in V . Thus, D' = T> and takes the form 

k k 

(10) E(j]T*/ i |2?)(y) = \\^E{n\D){y) 

i=0 i=0 

for v-a..e. y G D. 

Since every / G T-p is r n -invariant and T-& is a T-generating algebra for £>, the system 
D is strongly stationary. 

Step 3. We will now prove the second claim. First assume that the sub-a-algebra 
is determined by a finite partition V. Then V is T-generating for almost every 
ergodic component and relation (fTT)|) is valid for almost every ergodic component, pro- 
vided that we replace T> with the distal factor of the corresponding ergodic component. 
To simplify the notation we assume that X is ergodic, and we keep in mind that any 
result we get will be valid for almost every ergodic component. 

We claim that the sequence of partitions {T l V}i^z satisfies the conditions of Theorem 
15.51 To see this observe first that if we replace k with 2k in (flO|) and then apply T~ h we 
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get 

k k 

(11) E( J] Tf t \v)(y) = ft TE(MV)(y) 

i=—k i=—k 

for every fceN, u-a.e. y &Y, and P-measurable functions /j. Hence, 



J f9 dn v = J f dfx y J g d/j y 



for every k G N, and u-a.e. y, whenever / is \/° fe T*P-measurable and g is \/? T J, P- 
measurable. So {T" l, P}j e z is weak Bernoulli with respect to \i y for u-a.e. y. Theorem 15.51 
now implies that the system is a relative Bernoulli extension of its distal factor. Hence, 
almost every ergodic component is isomorphic to the direct product of its distal factor 
and a Bernoulli system. 

In general, since L 2 (X) is separable there exists a sequence {V n } n <=N of finite T- 
measurable partitions such that V n+ i refines V n and \l°^ =x V n = B{J-). Applying the 
previous argument for the factor that is T-generated by V n we get a factor X n which 
is isomorphic to the direct product of a distal system and a Bernoulli system. Since 
increasing unions of distal systems is distal and of Bernoulli systems Bernoulli ( |Ur74j . 
page 52), it follows that almost every ergodic component of X is isomorphic to the direct 
product of a distal system and a Bernoulli system. This completes the proof. □ 

5.4. The distal factor coincides with the C-factor. Let X be a measure preserving 
system. For fixed k G N consider the closed subalgebra generated by weak limits (in 
L 2 (X)) of the form 

1 Nl 

(12) wlim — V T^ n j\ T^/2 • ■ • T*"/*, 

1— »oo iVi ^— -" ' 

1 71=1 

where we are free to choose any ai, . . . , a k G N, functions fi, ■ ■ ■ , fk £ L°°(X.), and an 
increasing sequence {Ni}^ of positive integers that guarantees weak convergence. This 
algebra is conjugation closed and T- invariant so it gives rise to a factor If in addition 
we are free to choose any k G N we get a factor that extends every C k . We denote it by 
C and call it the C-factor or characteristic factor of X. Using the notation of Section 
15.11 the factor is characteristic for k terms and the factor C is characteristic for any 
number of terms. Moreover, by Theorem 15.21 we have C k C D^_i for every k G N and 
so C C D. 

First, we prove that the C-factor is r n - invariant for every n G N. 

Lemma 5.8. Let (X,JF) be a strongly stationary system and {T m } mg ^ be the maps 
defined in Proposition ^. 51 Then for all k,m G N the subspaces L 2 {C k ) and L 2 (C) are 
r m -invariant. 
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Proof. Let k,m G N. Using the commutation relations r m T" = T nm T m (see Proposition 
13. 5|) we see that the operator T m maps functions of the form (|T2|) to functions of the 
form 

1 Nl 

wlim-V T ai " m (r m A) T a2 " m (r m / 2 ) ■■• T a * nm (r m f k ), 

l— >oo iV; 

belongs again to L 2 (Cfc). Since functions of the form (|T2j) generate L 2 (C k ) the subspace 
L 2 (Cfc) is r m -invariant. A similar argument applies for L 2 (C). □ 

Theorem 5.9. Every zero entropy (and hence every distal) strongly stationary system 
coincides with its C-factor. 

Proof. Since C is T-invariant, and T is T-generating, it suffices to show that every / G T 
is C-measurable. Equivalently, if / G T and g = f— E(/|C), we need to show that g = 0. 
By Lemma f5.8l the subspace £ 2 (C) is r n -invariant. Moreover, / is r n -invariant so g is r n - 
invariant for n G N. Given functions fi, ■ ■ ■ , fk € J 7 , there exists an increasing sequence 
{iV;}; e N of positive integers such that the sequence 

1 Nl 

— T "/i • • • T kn f k 

1 n=l 

converges weakly in L 2 (X) as m — > oo. Since r^T = T n T n , the functions /j and ^ are 
r n -invariant, and r n is measure preserving for n G N we get 

, 1 A /• 

/ jT/i • ■ ■ T fc /fe dA* = — ^ / 9 T n /i • • ■ T kn f k dfi 

J 1 n =l J 

for k e N, fi E J 7 . Since E(gr|C) = 0, we have K(g\C k ) = and by the definition of C k the 
last average is 0. Thus, g is orthogonal to the closed subalgebra spanned by bounded 
y oo j>* jr_ measura ^ e functions. Since X has zero entropy we have T % T = V/^°oo T 1 ^. 
It follows that g is orthogonal to the full algebra of the system. Hence g = 0, proving 
our claim. □ 

6. Distal strongly stationary systems 

6.1. Nilrotations. Let G be a locally compact and separable Lie group. The commu- 
tator of two elements g, h G G is the element [g, h] = g~ 1 h~ 1 gh. If A, B C G we write 
[A,B] for the subgroup generated by {[a, b}: a G A, b G £>}. The lower central series of 
G is defined as follows, G<°> = C7, C7« = [G, The group G is nilpotent of order 

k if G^- 1 = {e}, where e denotes the identity element of G. With G we denote the 
connected component of the identity element of G. If F is a discrete subgroup (not nec- 
essarily normal) of an order k nilpotent group such that G/Y is compact we call G/Y an 
order nilmanifold. The group G acts on G/Y by left translation T a (xr) = (ax)Y. There 
exists a unique probability measure on G/Y that is invariant under left translations, we 



20 



NIKOS FRANTZIKINAKIS 



denote it by \x and call it the Haar measure on G/T . If G is nilpotent of order k, we call 
the system G/T = (G/T,Q/T,T a , fi) an order k nilsystem and the transformation T a an 
order k nilrotation. An inverse limit of (order k) nilsystems (nilmanifolds) is called an 
(order k) pro-nilsystem (pro-nilmanifold) . 

The following generalization of a theorem of Parry QPa69j) was proved by Leibman 
|Lei03j . It was originally established by Lesigne |Les91j under an extra hypothesis. 
Bergelson and Host ([BH03J) showed that this extra hypothesis can be removed, thus 
providing another independent proof. 

Theorem 6.1 (Leibman). Let (G/T,T a ) be a nilsystem and set Z = G/G^T. If G is 

spanned by the connected component Go and a then 

(i) The nilsystem (G/T, T a ) is uniquely ergodic if and only if the factor system (Z, T a ) 
is ergodic. 

(ii) If (G/T, Tg) is ergodic then its Kronecker factor is (Z,T a ). 

Remark. As it was noted in |BH03j if the nilsystem (G/T,T a ) is ergodic then the pro- 
jection of < Go, a > on G/T being an open invariant set is equal to G/T. Hence, 
G/T =< Go, a > / (rn < Go? a >). So if T a is ergodic we can assume that G =< Go, a >. 

Examples, (i) The prototypical example of a nonabelian order two ergodic nilsystem 
is defined on the Heisenberg nilmanifold. Let 

f fl xi x 3 \ ] r fl fci fc 3 \ 

G = <Mo i X2 j , Xi e m| , r = j f o i k 2 j,kiEZ 

where ai,a2 G R are rationally independent and 03 G K. Then G with the standard 
metric is locally compact and connected. Moreover, if the group action is matrix multi- 
plication then G is nilpotent of order two and G/T is compact. So T a defines a nilsystem 
on G/T. Observe that G/G^'T ~ T 2 and that the rotation on T 2 by (01,02) is ergodic. 
It follows from Theorem Ifi . II that (G/T,T a ) is uniquely ergodic and its Kronecker factor 
is induced by the functions on xi,X2- 
(ii) Let 







where b is irrational. Then (G/T,T a ) is uniquely ergodic and isomorphic to the affine 
system on T 2 with the Haar measure induced by T(x, y) = (x + b, y + x). 

It turns out that every measure preserving system that is induced by some distal 
affine transformation on compact abelian group with the Haar measure is isomorphic to 
a nilsystem. But not every nilsystem is isomorphic to an affine system. For example the 
order two nilsystem of example (i) is not ( |Fu90| . page 52). 
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6.2. Nonconventional ergodic averages. The following theorem of Host and Kra 
( |HK0 3j) is crucial for our study. 

Theorem 6.2 (Host and Kra). Let X be an invertible ergodic measure preserving 
system. Then the averages 

1 - 

lim — V T n /i T 2n f 2 ■ ■ ■ T kn f k 

n=l 

converge in L 2 (X.) for fa G L°°(X). Moreover, the characteristic factor (defined in 
Section \5.4\) for these averages is an order k — 1 pro-nilsystem. 

If T is an ergodic order k nilrotation it is possible to find an explicit formula for the 
limit. This was done for G connected and k = 3 by Lesigne QLcs89j) and for general 
(not necessarily connected) G and k by Ziegler ([Zi03j. To describe the formula it is 
convenient to first establish some notation. 

Let G/r be an order / ergodic nilsystem. It turns out ( |Lei02j ) that for every fcGN 
the set 

H k = [(x u x\x 2 , xPxP-- - xP), liGG' 1 - 1 '} 

is a closed subgroup of G X • • • x G (the product has k terms) with group action coordi- 
natewise multiplication. If A k = H k PI (T x • • • x T), then the quotient H^/A^ is again a 
nilmanifold and supports a unique left invariant (under translations by elements in H k ) 
measure that we denote by vn k - 

Theorem 6.3 (Ziegler). Let (G/T,T a ) be an order I ergodic nilsystem (assume that 
G =< Go, a >). If fx, . . . , fk G L°°{G/Y) then for almost every x G G we have 

1 - 

n=l 

/ h{xyiY) ■■■ f k (xy k T) dv Hk (yA k ) 

JH k /A k 

where y = {y 1: ...,y k ). 

The next identity will enable us later to give a general method for constructing strongly 
stationary systems starting from totally ergodic measure preserving systems. It is a 
consequence of Theorems 16.21 and 16.31 

Theorem 6.4. Let X be an ergodic measure preserving system such that T r is ergodic 
for some r G N. Then 

N N 

( 13 ) i im /» T ^- Thn f* ^ = i im mJ2 f° r /i • • • Thrn f* ^ 

n=l J n=l J 

forfieL°°(X). 
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Proof. Assume first that T a is an order I nilrotation defined on G/Y. Since T£ is ergodic 
T a is also ergodic, so Theorem 16.31 gives that 

JV 



! im Tt E / ^{xY) T a/i(* r ) • • • T?f k (xr) d^(xY) = 

f (xV)fi(xy 1 V) ■■■ f k (xy k Y) du Hk (yA k ) dfi(xT), 



N 



'G/rJH k /A k 

where fa G L°°(N/T), and H k , A k , Ujj h are defined as before. Observe that the integral 
on the right does not depend on the nilrotation T a on G/Y as long as it is ergodic. Since 
T ra = is assumed to be ergodic we get that 

N 



1 - f 

J™ M E / fo T raflW ■ ■ ■ T k r n J k (xY) d»(xY) - 

i - r 

J im m E / fo XThW • ■ ■ T rkn f k (xY) d»{xY). 
^-+°°a r~f Jg/t 



>G/r 

So (fTHJ) holds when T is a nilrotation. A standard approximation argument shows that 
(fTH|) holds when T defines a pro-nilsystem. In the general case we have 

1 N r 

J im ^ E / h Tn h • ■ • T " n ^ ^ = 



i im ^E / E ^\C) T n E(/!|C) ••• T kn E(f k \C) dp, 

N^oo iv ^ ' _/ 



n=l 

where C is the characteristic factor of the system. It follows from Theorem 16.21 that the 
factor C is a pro-nilsystem, so the limit on the right is equal to 

N 

i im ^E / E ^ TrnE (h\C) ■■■ T rkn E(f k \C) dii. 

n=l 17 

Since the factor C is characteristic for all terms the last limit is equal to 

N 



/v^SoArE / /o T rn /i ■■■ T rkn f k dfi. 

n=l J 



N- 

The result follows. □ 

We remark that identity (|13|) was proved for k = 3 by Host and Kra QHK02 ). It was 
also shown there how it implies an odd version of the ergodic Szemeredi theorem. For 
general k we get: 
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Corollary 6.5. Let X be a measure preserving system. Assume that T r is ergodic for 
some r e N and let A be a measurable set with fi(A) > 0. Then for every < j < r we 
have 



for some n = j (mod r) . 

The odd Szemeredi theorem corresponds to the case r = 2, j = 1. 

6.3. The structure of the ergodic components. Theorem 16 . 21 combined with Theo- 
rems and EIHl enables us to determine the structure of the ergodic components of the 
general strongly stationary system. 

Theorem 6.6. Almost every ergodic component of a strongly stationary system is iso- 
morphic to the direct product of a Bernoulli system and a totally ergodic pro-nilsystem. 

Proof. Theorem 15. 71 shows that almost every ergodic component of a strongly stationary 
system is the direct product of a Bernoulli system and a distal strongly stationary sys- 
tem. By Theorem 15 .91 a distal strongly stationary system coincides with its C-factor, so 
Theorem 16.21 shows that almost every ergodic component of the distal factor of the sys- 
tem is a pro-nilsystem. Finally, by Proposition 13. 71 the system has no rational eigenvalue 
different than 1, so a nontrivial set of ergodic components cannot share the same ratio- 
nal eigenvalue provided that it is different than 1. It follows that almost every ergodic 
component of the system is totally ergodic, which completes the proof. □ 

We will discuss the structure of the global system in more detail in the next section. 
We will see that there exist distal strongly systems with nonaffine ergodic components. 
This new class of examples will allow a complete classification. 

6.4. New examples and structure theorem. Let a be a stationary measure on the 
sequence space I z . We define a new measure a av on J z by averaging the statistics of 
a along arithmetic progressions. More precisely if JF is the algebra of bounded xq- 
measurable functions and S denotes the shift transformation, we define the measure a av 
on cylinder sets by 



for fi e (the limit exists by Theorem 16 ,2jl . We then extend o~ av to the whole sequence 
space. 



n(AnT- n An---nT~ kn A) > o 



(14) 



/o S/x • • • S k f k da av = lim 1 V [ f S"/i • • • ^ kn f k da, 




Theorem 6.7. Let a be a totally ergodic stationary measure. Then the measure a av is 
strongly stationary. 
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Proof. A direct computation proves stationarity. Since S is totally ergodic by Theorem 
16.41 we have 

N N 

,! im m E / fo ■ • • S " n ^ da = i im vE a srn/i • ■ ■ srfcn/fc rfa 

n=l ^ n=l ^ 

for all r,k eN, fi e JF . Hence 

/o S/i ■ • • S fe /fc ^crai) = J fo S r /i • • ■ S rk f k da av 

for all r,k e N, fi e T§. Since the subalgebra Tq is S-generating the measure a av is 
strongly stationary. □ 

We remark that under the hypothesis of the previous theorem the strongly stationary 
measure o~ av can be shown to be extremal. We will not use this fact so we omit its proof. 

Proposition 6.8. Let o be an extremal strongly stationary measure with ergodic decom- 
position a = f o~t d\(t). Then for A almost every t we have a = (cr t ) 



lav ■ 



Proof. Strong stationarity gives 

JV 



//oS/i ••• S*/*d*= Mm 1 £ I foS n fi ... s kn f k da 

J n=l ^ 

= | (jim JfYtffa S "/i • ■ ■ S fe "/fc da t ) d\(t) 



for all fc G N, fi E T Q . It follows that 

a = I {°t)av d\(t). 



By Theorem 13 . 71 almost every ergodic component of S is totally ergodic so Theorem 16.71 
gives that the measures (<Jt) a v are strongly stationary for A-a.e. t. Since a is extremal 
we have that a = (o~ t ) av for A-a.e. t. □ 

We will use Theorem 16. 71 to construct an ample supply of strongly stationary systems. 
We briefly describe the strategy. Starting with an arbitrary invertible totally ergodic 
measure preserving system X we first consider its sequence space representation with 
respect to T = L°°(X) (Proposition 12. 4j) . This representation is determined by a sta- 
tionary measure a on J z , so X is isomorphic to the system B z , a, S) where S is the 
shift transformation. Let <p: X — > I z be the isomorphism = T§ = Xo-measurable 

functions). We construct the strongly stationary measure a av as in (JEJ), that is, we 
define the measure a av on cylinder sets by 

N 

(15) / f S/i • • ■ S k f k da av = Jim 1 £ I & T n f[ • • ■ T kn f k dp, 

J n=l ^ 
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where f G JF and f! — fa o <p e T . Finally, we give an explicit description of the 
statistics of o~ av . This way we recover all the examples mentioned in Section l3~H and we 
also construct some new ones. 

Examples, (i) Suppose that X is a weak mixing system. Using the multiple weak 
mixing theorem ( |Fu81j . page 86) we can check that the resulting strongly stationary 
measure defined by (JT5|l is a Bernoulli measure. 

(ii) Suppose that X is the system induced by an ergodic rotation on T with the Haar 
measure m. If we compute the limit in (JT5j) we find that 

J fo S/i • • • S k fk da av = J ^ fo(y) f[{y + x) ■■■ f' k (y + kx) dm(y) dm(x). 

We can check that a av determines the sequence space representation of the strongly 
stationary system T(x,y) = (x,y + x) on T 2 with the Haar measure m (with respect to 
the algebra generated by the exponentials in y). 

(iii) Suppose that X is the system induced by an affine transformation on T 2 with the 
Haar measure defined by T(x,y) = (x + a, y + x), where a is irrational. We can check 
that the resulting strongly stationary measure defined by (j!5)) determines the sequence 
space representation of the system T'(x,y, z) = (x,y + x, z + y) on T 3 with the Haar 
measure (with respect to the algebra generated by the exponentials in z). 

(iv) Suppose that X is an order I totally ergodic nilsystem defined on X = G/T. Using 
Theorem 16 .HI we see that the resulting strongly stationary measure a av constructed by 
fT5|) is defined on cylinder sets as follows 

(16) / f S/i ••■ S k f k da av = / / fo(xT)f[(xyiT) ■■■ f k (xy k T) du Hlt dn, 

J J G/r JH k /A k 

where H k , A k , u Hk are defined as in Theorem 16.31 Then the system (J z , £> z , a av , S) is 
strongly stationary with respect to T§. 

(v) Suppose that X is a totally ergodic pro-nilsystem defined on the inverse limit X 
of the nilmanifolds X^ (X has to be connected since it supports a totally ergodic pro- 
nilsystem). Then the resulting strongly stationary measure a av constructed by (|T5j) is 
defined on 0pQ) by (|16j). Since I z = Uj0(-^i) ^is uniquely determines the measure 
a av on J z . Observe that the resulting strongly stationary system (J z , B z , a av , S) depends 
only on the pro-nilmanifold X. We call it the strongly stationary system associated to 
the pro-nilmanifold X. 

We remark that in example (iv) if X is the Heisenberg nilmanifold (see Section 6.1 
example (i)) then the ergodic components of the strongly stationary system obtained are 
nilrotations on X, and hence nonaffine ( (Fu90j . page 52). 

This new set of examples enables us to completely determine the structure of the 
general strongly stationary system. This is the context of our main theorem: 
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Theorem 6.9. Every extremal strongly stationary system (X,JF) is isomorphic to the 
direct product of a Bernoulli system and a strongly stationary system associated to some 
pro-nilmanifold. 

Proof. Suppose that the ergodic components of X are the systems X t . Let a, a t be the 
measures that determine the sequence space representations of X and X t with respect 
to L°°(X) and L°°(X t ). Then a = J a t dX is the ergodic decomposition of a. Since 
a is an extremal strongly stationary measure by Proposition 16.81 we have o = (a t ) av 
for A-a.e. t. Theorem 16.61 implies that such a measure a t has the form p x r where p 
and r determine the sequence space representation of a Bernoulli system and a totally 
ergodic pro-nilsystem N correspondingly. Then a = (p x r) av = p av x r av . The measure 
p av induces a Bernoulli system B (example (i)) and the measure r av induces a strongly 
stationary system P associated to the pro-nilmanifold N (example (v)). Hence, X is 
isomorphic to the direct product B x P and the result follows. □ 
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